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ABSTRACT 

Classification  of  information  seems  to  offer  a more  logical 
arranromont  of  meanings  and  associations  than  dors  fSipn“:*r:  1.4. 
indexing.  However,  fifty  years  of  li.bri.ry  experience  with  classi- 
fication systems  indicates  that  classification  is  useful  only 
within  the  limited  fields  of  the  taxonomic  sciences. 

The  recent  rebirth  in  in to  jest  in  general  classification 
systcinfl  is  t-rtcsd  to  csrtftir;  j,rcblsms  rclsti/'.p  to  ths  s’.scbl  r:S 
storage  and  retrieval  of  information}  and  it.  i3  shown  that  this 
new  interest,  together  with  the  new  terminology  of  "abstraction 
ladders,"  "semantic  factoring"  and  "categorization,"  offers  little 
promise  of  solving  the  inherent  difficulties  of  hierarchical 
classification.  It  is  concluded  that  classification  remains  a 
"blind  alley"  and  that  other  techniques  and  principles  of  as- 
sociating meanings  must  be  found  and  developed. 


MACHINES  AND  CLASSIFICATION  IN  THE  ORGANIZATION  OF  INFORMATION 


PART  I 


In  previous  papers  we  have  recognized  that  cl&ssl* 
.flea tier*,  a*  e^ntrast^li  with  aj. nhohr; t.1  C~  1 sijhlnr.t*  headings 
or  coordinate  indexing  supplies  a kind  of  connect! veneao 
between  ideas  ani  provides  the  possibility  of  "browsing’' 
through  related  ideas. 


"Alphabetical  arrangement  ia  most  convenient  for 
the  user  who  can  precisely  name  the  subject  of 
his  3oarch  in  tho  same  terminology  as  used  by 
this  indexer,  but  may  make  searching  difficult 
for  These  users  vho  s.rs  not 

with  the  fashions  in  nomenclature,  who  are  not 
completely  familiar  with  the  subject  of  search 
who  have  only  a vague  question  in  mind,  etc., 
can  be  helped  by  a system  whose  arrangement  is 
related  to  the  organization  of  the  field  searched." 


"Classification  can  arrange  ideas,  not  merely  words, 
since  moaning  can  bo  Indicated  through  position, 
as  Well  as  phraseology  where  terminology  is  not 
fixed.  Browsing  among  related  concepts  i3  of 
courne,  facilitated  by  placing  them  in  proximity."^ 


Tho  degree  to  which  any  classification  system  as- 


sociates ideas  1&  a measure  of  tho  effectiveness  of 


^'Studies  ir,  Coordinate  Indexi  ng,  pp.  60-69. 
Tncor po ratecTi  V'ashing ton  , D , C - (19^3). 
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the  system.  Where  classification  fails  is  ir,  the  arbitrary 
disaasociations  which  are  imposed  on  related  ideas  by  the  re- 
quirements of  the  system,  These  arbitrary  disassoci* lions  ar* 
hidden  and  the  searcher  who  is  content  with  the  relationships 
displayed  in  the  system  will  obtain  only  partial  Information, 

In  short,  for  rsasens  which  will  bo  set  forth  in  detail 
below,  classification  systems  are  not  truly  effective  instru- 
ments for  displaying  to  ins  browser  or  searcher  all  the  ideas 
in  any  system  which  are  associated  with  any  givan  idea  with 
which  the  system  is  entered. 

This  fact  was  recognised  by  Dr.  V*nr»var  dush  in  the 
2 

n.otation  which  posed  the  general  problem  ef  tn®  assoc!- sti on 
of  ideas  as  presented  in  our  previous  report. 

In  spite  of  all  the  theoretical  and  practl.-.i  objections 
which  can  be  marshalled  against  classification  as  a method  for 

a 

organizing  information,  classifies  tier,  systems  are  apparently 
successful  in  associating  ideas  torn  asunder  by  alphabetical 
indexes.  This  leads  many  Individuals  and  orraiiizations  to  ever 
new  attempts  to  devise  class  i cation  systems,  These  attempts 
have  f.iultiplied  in  recent  years,  not  because  of  any  actual 
successes  on  the  record,  or  any  now  developments  in  classifi- 
cation theory.  They  have  multiplied  because  the  machine  search- 
ing of  any  considerable  body  of  information  seems  to  require 
2 

"Our  ineptitude  in  getting  at  the  record  id  largely  caused  by 
the  artifir lily  of  systems. ., and  information  is  found  (when 
it  is)  b/  tracing  it  down  from  *ubrlA«*s  to  ?ubcla'?9," 


that  tho  information  be  pre-arranged  in  a classification  system. 

The  classification  of  knowledge  in  the  bread  philosophical 
sense  is  as  old  as  self-conscious  knowledge  itself}  but  the 
classification  of  books,  items  of  literature,  or  items  of  in- 
formation is  a product  of  the  nineteenth  century.  It  is  cus- 
tomary to  explain  the  adoption  of  classification  systems  by 
libraries  in  cerms  of  cne  growtn  of  tho  open-shelf  system  of 
public  and  college  libraries  which  occurred  during  the  nine- 
teenth century.  If  books  are  to  be  displayed  for  patrons  to 
do  their  own  browsing  and  make  their  own  selection,  tne  books 
must  be  arranged  in  some  generalized  subject  order,  e.g., 
science,  religion,  fiction,  history,  hobbies,  etc. 

This  explanation  doss  not  illuminate  the  jump  from  gen- 
eralized shelf  arrangement  to  universal  systems  involving  close 
and  detailed  classification  of  each  book  or  of  each  item  of  in- 
formations i?omj  other  explanation  is  noseted  to  acc yi>'iu  its  tne 
tremendous  emphasis  upon  classification  systems  at  the  turn  of 
the  nineteenth  and  twentieth  centuries.  In  & few  generations, 
we  have  witnessed  the  development  of  the  Dewey  system,  the 
Library  of  Congress  system,  the  Cutter  expansive  system,  the 
Universal  Decimal  Classification,  the  Brown  Classification,  tho 
Col.ii.  ;’l-  i lion,  the  Bliss  Classification,  tho  U.  S» 

Patent  Office  Classification,  and  « host  of  others  which  were 
born  and  have  died  in  som.*»  local  library  cr  information  center. 


« «• 

The  explanation  for  this  phenomenon  is  to  be  found  in  the 
temper  of  the  times.  The  nineteenth  century  was  the  age  of 
biology  in  the  sense  in  which  the  seventeenth  century  was  the 
age  of  physical  science  and  the  eighteenth  century  was  the  age 
of  reason  and  enlightenment.  This  is  not  to  say  that  thsre  was 
not  important  work  in  the  physical  sciences  in  the  nineteenth 
century.  Maxwell.  Faraday.  rfYihh*.  peanOj  Frege,  Gauss ; tffcimholt* 
and  dozens  of  other  important  chemists,  mathematicians,  and 
physicists  lived  and  worked  during  this  period,  and  there  were 
important  diseovnrias  in  all  fields  of  science,  But  the  ideas 
which  were  generalised  beyond  the  laboratory  and  which  established 
tha  intellectual  climate  of  the  age  came  from  the  science  of 
biology.  From  biological  anelogs  came  the  ideas  of  social  evo- 
lution, the  claas  struggle,  survival  of  the  fittest,  the  wnite 
Usui's  burden,  manifest  destiny,  and  the  iron  law  of  wages.  Since 
biology  is  a taxonomic  science,,  a science  of  classes,  it  is  reason 
able  to  expert  that  librarians  and  other  systems tizers  should  em- 
ploy the  biologies!  notions  of  taxonomy  and  hierarchical  classes 
to  organize  their  books  and  itemfc  of  information. 

Therms  builders  of  classification  systems  might  hawe  built 
better  systems  had  they  the  wit  or  wisdom  to  percei.s  that  their 
own  atru-'turco  were  products  of  an  evolution  which  would  in  turn 
destroy  them.  Charles  Sanders  Pierce,  who  was  a mathematician 
and  one  of  the  great  creators  of  modern  logic,  and  not  a biologist 
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warned  the  system  builders  of  his  day  that  the  better  they  built 
for  the  day,  the  shorter  would  be  the  life  of  what  they  built*3 
And  Nietzsche,  who  took  evolution  seriously,  recognized  that  all 
the  ’-’aiues  of  his  day  must  be  superseded)  that  tha  superman  must 
follow  man  in  the  chain  of  evolution.  This  insight,  and  perhaps 
a few  biological  complications,  drove  him  mad. 

For  it  is  certainly  the  cream  of  tha  jest  that  this  age 
of  biology  was  also  in  fact  the  age  of  Victorian  smugness, 
f.volution  explained  all  development  and  all  error  up  to  the 
status  quo,  and  some  way,  some  how,  evolution  was  supposed  to 
cease  in  its  highest  product,  nineteenth  century  Europe. 

In  1937,  writing  on  this  same  topic  in  collaboration  vita 
Dr.  John  Lund,  we  summed  up  the  situation  in  these  words i 


KThr  nineteenth  century  had  an  abiding  faith  in  the 
permanence  of  its  values  end  the  ultimate  validity 
of  its  scientific  structures.  This  is  illustrated 
by  the  belief  of  svstematiaers  that,  once  a good 
classification  of  knowledge  was  achieved,  it  would 
bo  permanent.  They  did  not  learn  from  the  fate  of 
previous  systems  that-  their  own  must  of  necessity 
become  obsolete. 

'Decimal  classification  was  born  in  a 
period  when  mankind  had  full  confidence  in 
the  all -might!  ness  of  materialistic  wisdom. 

The  middle  of  the  nineteenth  century  was  the 
cuiminucion  point  of  scientific  positivism. 

It  seemed  that  the  totality  of  available 
knowledge  as  well  of  future  knowledge  f*ould 
be  arranged  in  a simple  predetermined  plan. 
Forgotten  was  the  word  of  wisdom  that  Hamlet 
to  Horatio  spoke. ’ "5 

^Collected  papers  of  Ch&rlen  banders  Pierce,,  p.8j.  edited  by 
"Charles  hartshorns  and  Paul  h'eiss,  rfarvaru  University  Press, 
dambridge* 

^The  i.i  hmrv  On»rti*r'’  v.  7 . 

>_  .r  -■  .•'f.-j-r  * • 

5nr.  Conker  Duyvis.  in  an  address  given  before  tne  British 
Society  of  Bibliography*  Reported  in  "Notes  and  nows,"  p.?i»3. 


Lest  it  be  supposed  that  we  were  unfair  then  end  unfair 
now  in  our  estimate  of  the  temper  of  the  times,  we  quote  be- 
low from  the  annual  Register  of  thG  University  of  Chicago  for 
1902.  In  the  announcement  of  courses  for  the  Department  of 
Physics  for  that  year,  students  electing  physics  were  told: 

'"n'iiilo  it  i3  never  safe  to  affirm  that  the  future 
of  Physical  Science  has  nc  marvels  in  store  even 
more  astonishing  than  those  of  the  past-,  it  seems 
probable  that  most  of  the  grand  underlying  princi- 
ples have  been  firmly  establinhod , nrH  that-  further 
advances  «u-e  to  be  sought  chiefly  in  the  rigorous 
aupiication  of  these  principles  to  all  the  phenomena 
which  coma  under  our  notice. 

It  is  here  tnat  the  science  of  measurement  shows 
its  importance  where  quantitative  results  axe  more 
to  be  desired  than  qualitative  work.  An  eminent 
physicist  has  remarked  that  the  future  truths  of 
Physical  S dsneo  to  be  looked  for  in  the  sixth 
pi«uc  uf  decimals."'' 

The  chairman  of  the  Department  of  Physics  at  that  time 
»»a  Professor  A.  a.  Hicmuson  of  the  lamed  Micheieon-Morely 
experiment  whose  implications  already  threatened  the  stability 
of  the  "grand  underlying  principles  /so  established, ” 

Is  it  any  wonder,  then,  that  "mere"  librarians  should  de- 
lude themselres  into  thinking  that  they  could  classify  all 
knowledge  for  all  time?  Dewey’s  proud  boast  that  his  classi- 
fication system  could  include  any  new  developments  in  knowledge 
through  the  device  of  adding  another  digit  after  the  decimal 
point,  is  exactly  on  a par  with  the  assertion  that  new  develop- 
ments in  physical  science  would  consist  of  refinements  of  measurement 


"Annual  Register  of  the  University  of  Chicago,  p.  292  (1901-1502), 
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A few  years  ago,  It  could  be  said  with  considerable 
assurance  that  classification  was  a dead  issue  ss  far  as 
librarianship  and  documentation  were  concerned.  More  and  more 
librarians  and  scientists  had  come  to  depend  on  alphabetical 
subject-heading  systems  end  *'j>h*betical  indexes.  The  excessive 
preoccupation  of  the  FID  with  the  Universal  Decimal  Classi- 
fication was  largely  responsible  for  its  lack  of  influenza 

m 

and  effectiveness  in  practical  librarianship  and  documentation.' 
Libraries  already  committed  to  various  classification  systems 
had  C'  me  to  regard  such  systems  as  devices  for  shelf  natation 
and  not  as  usable  and  viabla  keys  to  the  subject  content  of 
their  collections. 

A few  names,  Bliss  and  Rangar.a1  r±r.t  a few  libraries,  the  John 
Cr«rar  Library  in  Chicago  and  the  Engineering  Societies  Library 
in  New  -’ork  retained  throughout  the  first  half  of  the  present 
century  a practical  interest  in  problems  of  classification.  But 
the  general  feeling  on  matters  of  classification  has  been  veil 
summed  up  by  Dean  Jesse  Sher*  of  Western  Reserve  University 
School  of  Library  Science  in  the  following  passages! 

"Today,  under  the  impact  of  a rapidly  growing  volume 
of  graphic  records,  and  the  appearance  of  new  forms 
of  publication,  traditional  library  classifications 
are  becoming  hopelessly  inadequate.  No  amount  of 
basic  revision  or  tampering  with  their  organic  structure 
can  save  them  from  this  failure.  As  guides  to  the 

7 

Bibliographical  Services i Their  Present  State  and  Possibilities 
of  ImprovemenC  XppendEc,  p.  12  (1^6;."  IneUflEScC” Library 

jfji  U S J L J - S « 

v.  wii(iogs  survey. 
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subject  content  of  the  library  they  ere  essentially 
meaningless,  Even  librarians,  who  are  best  qualified 
to  interpret  them  and  to  exploit  their  virtues,  use 
the  notation  only  as  a guide  to  location,  and  largely 
ignore  the  interdisciplinary  relationships  that  they 
were  designed  to  reveal.  Yet,  as  their  efficiency  has 
declined,  the  cost  of  their  maintenance  has  increased 
until  at  least  one  major  research  library  has  abandoned 
subject  classification  cf  its  book  stocks  and  has  turned 
to  other  and„rnorc  promising  forms  of  bibliographic  or- 
ganization, 

"The  history  of  library  classification,  then,  has  been 
the  narrative  of  a pursuit  of  impossible  goals,  and  it* 
pages  are  strewn  with,  the  wreckage  of  those  who  sithsr 
were  blissfully  unaware  of  the  dangers  by  which  their 
paths  were  beset,  or  who  hoped  to  circumvent  them 
through  mere  modification  of  previous  schematisms  or 
simple  tinkering  with  notation.  Today  the  essential 
failure  of  traditional  library  classifications  i3  nc 
more  real  than  it  was  three-auartera  of  a century  ago, 
but  it  has  become  more  apparent  because  of  the  in- 
creasing bulk  und  complexity  cf  the  materials  that 
libraries  are  being  called  upon  ic  service,  and  tha 
growing  specialization  cf  the  demands  that  librarians 
are  being  asked  to  meet,"? 


"Classification  as  the  Basic  of  Bibliographic  Organisation" 
in  bibliographic  Organization,  Papers  Presented  before  the 
Flf  teer.th  Annual  CoTiTercnco  of  the  Q ra3uaie~TT5rary  School, 
July  P ■»  7r~  EdTcea”T-y~ Jesse  M.  Shera  and 

Fargaret  E,  Egan,  Chicago,  The  University  of  Chicago  Precis, 
( i oci  \ 

$ 

"Glassification:  Current  Functions  and  Applications  to  be 
the  Subject-  Analysis  of  Library  Materials, " in  The  Subject 
Analysis  of  Library  Iinte rials,  p.  }2 , Edited  by^TiauriccTE, 
TauKer,  'Softool  o'f  Lilrary'Service.,  Colunoia  University, 

New  York  (1953K 


A 
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How  then  do  we  Account  for  the  renewed  interest  in  classi- 
fication as  a method  of  information  control?  Within  the  last 
.few  years,  we  have  witnessed  t ..u  birth  (and  in  some  cases,  the 
rapid  death)  of  dozers  of  new  classification  systems,  among 
which  we  can  name-,  The  Story  Classification  for  the  Army 
Technical  Reference  Service}  the  Office  of  Naval  Research 
Project  Status  Classification}  the  Research  und  Development 
Board  Cl*A?ifi**»tior5  of  research  project; , the  American 
Society  for  Metals  - Special  Libraries  Association  Metallur- 
gical Literature  Classification,  ana  the  Standard  Aeronautical 
Indexing  System,10  There  has  been  a revival  of  interest  in  the 
Universal  Decimal  Classification,  in  the  Patent  Office  Classi- 
fication, ana  in  Kanganthan’s  Colon  Classification,  A research 
project  supported  by  Federal  funds  has  labored  fer  several 
years  and  is  still  laboring  on  the  development  of  "abstraction 
ladders"  and  "semantic  factoring". 

This  renewed  search  for  the  solution  to  an  unsol vahie 
problem  results  from  a paradox,  nAmely,  the  promise  of  machine 
organization  and  retrieval  of  information,  and  t.he  actual 
aluMiwDB  of  the  mac nine  in  tne  linear  saarching  of  an  index. 

As  we  shall  see  in  the  following  discussion,  classification 
becomes  one  of  the  methods  proposed  for  dividing  an  index  in 
order  to  shorten  the  time  requii-e-i  for  e maenme  search, 

10 

In  spite  of  the  name,  the  Standard  Aeronautical  Indexing 

System  is  a Hierarchical  classification  system. 
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PART  II 


Let  us  suppose  we  are  searching  for  the  name  "Baker. 

Abie  Charlie"  in  a village  telephone  book  containing  about 
1COO  names.  To  search  for  this  name  might  take  a minute  or 
two*  occupied  with  picking  up  the  book,  finding  the  proper 
page  and  column,  and  scanning  the  proper  column  for  the  name 
being  sought.  Now  It-  is  quite  practical  to  utilize  an  iSn 
macmne,  or  some  similar  machine,  or  even  a deck  of  edge- 
notched  cards,  tc  find  one  name  in  a random  file  of  a thousand 
names,  in  about  the  same  time  required  for  the  manual  search 
of  an  alphabetical  file  in  a minute  or  two.  But  suppose  we 
are  looking  for  the  name  "Baker,  Able  Charlie"  In  a liBt  of 
a million  names  comparable  to  the  New  fork  telephone  book. 

It  might  take  us  a little  longer  tc  lift  the  heavier  book,  to 
find  the  right  page  and  the  right  column,  and  to  scan  by  the 
given  names  and  address  as  well  sa  the  last  name.  Nevertheless, 
the  time  required  for  a search  for  one  name  in  a alphabetical 
list  of  a million  names  is  of  the  same  order  of  magnitude  as 
the  time  reouired  to  find  one  name  in  an  alphabetical  list  of 
a thousand  names.  But  a machine  search  for  one  name  in  a 
random  list  cf  a million  names  will  take  one  thousand  times 


as  long  as  a machine  search  for  one  name  in  a thousand. 


It  was  tho  more  or  loss  va»"us  realization  of  this  fact 
that  led  the  early  advocates  of  the  application  of  punched-cnrd 


I 
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machines  for  the  organ! esiion  and  the  retrieval  of  inf ormation 
to  recognise  that  machine  methods  could  not  be  applied  efficient- 
ly to  the  random  searching  of  large  masses  of  information*  No 
machine  search  of  a large  random  list  can  approach  the  speed 
with  which  the  mind  can  jump  to  the  exact  position  in  an  ordered 
list.  It  would  be  silly  to  randomise  a list  of  names  in  a phone 
bockj  or  subject  hoA-dlngs  In  an  alphabetical  index , in  order  to 
search  for  any  particular  name  or  heading  vith  p'.mchad=card 
machines.  An  ordered  list  when  it  is  over  a certain  else  al- 
ways enables  the  Aind  which  recognises  and  utilises  the  order 
to  beet  the  machine.  The  conclusion  to  be  drawn  here  is  that 


contrary  to  popular  ml  seonceptinna ; the  larger  the  number  of 

mini  i fafidol  w A*  + n«4  ^a  4 « 14na««>  A--  ■*  — *>- — 

--- — t-.-.  1 --  v..w  ww  «•  •>»•»«.«•  win  va  amwi hms wavii j 

the  less  applicable  are  standard  pu.xhed-card  systems  or  even 
magnetic  tap*  systems  to  the  problem  oi  searching;  and  this 
conclusion  leads,  in  turn  to  a search  for  1)  weya  to  cut  down 
the  sise  of  indexes  and  2)  ways  to  prefils  or  classify  items 
of  information. 

The  extent  to  which  coordinate  indexing  cuts  down  the 
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of  all  permutations  of  te  rrns  in  the  indexing  system,  scorns  to 


offer  the  promise  of  efficient  use  of  machine  methods.  Consider 
for  example,  a collection  of  250,000  items  to  be  organized  in  a 
system  of  information  storage  and  retrieval.  The  items  might 


i 

i 

i 


bs  Anything  - documents,  reports,  patents , film  foot&ge  or 
iter.8  of  hardware  in  a supply  catalog.  With  standard  indexing 
systems,  the  site  of  the  index  would  bs  250, 000  times  the  number 
of  ways  each  item  was  indexed.  Let  us  assume  that  an  average 
of  four  terms  is  required  to  properly  indexing  or  identify  eech 
item-.  The  permutation  of  it  is  ih,  and  this  gives  us  a maximum 
figure  of  6,000,000,000  headings  in  the  index.  No  index  ever 
attempts  to  use  all  possible  permutations  of  its  terms  as  head- 
ings | but  a barely  adequate  index  in  which  each  item  is  in- 

* 

coxed  by  four  terms  will  have  at  least  four  times  the  number 
of  index  headings  aa  items.  In  this  case,  there  would  be 
1,000,000  indexing  headings  in  order  to  insure  that  each  term 
need  ir.  the  index  will  to  in  a filing  position  and  will  be 
found  in  proper  alphabetical  order,  A punched- card  system  which 
could  utilise  ens  card  per  document  end  enter  ail  four  indexing 
terms  of  a document  on  this  one  card  would  require  only  250,000 
cards  for  an  adequate  index.  The  same  reasoning  can  be  applied 
to  edge-notched  card  systems.  Hence,  it  is  true  that  the  use 
of  machine  methods  can  reduce  an  index  of  250,000  items  from 
250,000  x n (where  n is  the  muster  of  index  entires  per  item) 
to  250,000  by  eliminating  the  usual  requirement  of  preparing 
multiple  entries  for  each  itsm.  The  elimination  6?  multiple 
entries  follow*  from  the  capacity  cf  machine  systers  to  searen 
for  i\n  item  under  any  word  which  indexes  it  and  to  combine  all 
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such  words  for  an  item  on  one  curd.  We  can  see,  therefore, 
that  for  systems  in  which  n is  large  and  the  number  of  items 
is  SRtsiXi  puncned-cara  “.aciilneo  and  edge«notcuou  cards  do  offer 
the  promise  of  a considerable  reduction  in  the  size  of  such 
systems.  But  note  that  the  siae  in  any  instance  canno«  be 
smaller  than  the  number  of  items  in  the  system.  This  means 
that  when  a number  of  units  ia  large,  e.g.,  patent  files  , 1 oems 
of  supply,  intelligonce  reports,  scientific  and  technical  reports 
case  records  in  a large  hospital,  etc.,  ordinary  systems  of 
machine  organization  and  retrieval  are  not  practical:  The 


random  search  of  1*000*000  items  bv  standard  ounchsd-card 

11 

systems  will  take  about  33  hours  per  single  search. 

Surely  uw  time  required  to  find  .n  item  in  she  standara 
alphabetical  index  of  a million  items  or  ever.  10  million  is 
only  a fraction  of  this  time.  Of  course,  ar.  expenditure  of 
time  is  required  to  set  up  and  maintain  alphabetical  indexes, 
but  even  if  this  time  is  more  than  that  required  to  punch  a 
set  of  cards  to  be  maintained  in  random  order,  if  any  appreciable 
use  is  made  of  a random  system,  the  great  excess  of  time  for 


9±yta 


if  any,  realized,  in  setting  up  the  system.  Finally,  although 
some  systems  claim  the  possibility  of  asking  multiple  questions 


•‘"‘This  figure  is  based  on  the  ability  of  certain  experimental 
I&M  equipment  to  scan  an  entire  cerd  at  a rate  of  500  cards 
per  minute*  Standard,  commercially  available  IBM  equipment 
which  sorts  and  selects  a column  at  a time  would  require  33 
hours  for  the  first  searoh  oy  the  first  column.  Selection 
by  the  second,  third  eolu.r<n.  etc.,  would  require  an  additional 
time,  determined  by  the  number  ox'  cards  eliminated  st  each 
stop  of  the  search. 
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ciufj.ns  a b ingift  BftftT*cW  • the  total  system  is  searched  for  any 
question  or  questions  and  is  unavailable  for  consultation  by 
any  other  searcher.  The  conclusion  to  be  drawn  from  the. 
considerations  outlined  above  la  that  the  reduction  in  the 
siae  of  a file  made  possible  by  coordinate  indexing  does  not 
in  itself  establish  the  practicability  of  punchcd-card  search- 
ing of  large  systems  * Some  more  drastic  reduction  in  sise  is 
required. 

If  instead  of  searching  the  filo  we  collate  or  coordinate 
terns,  we  can  enormously  reduce  the  time  of  searching,  but  we 
will  have  to  pay  for  this  reduction  in  searching  tima  by  a 
compensatory  increase  in  the  alae  of  the  file*  Consider  the 
following  arrays  in  which  the  letters  represent,  ideas  or  terms 
in  the  index  and  the  numbers  represent  the  items  to  be  indexed. 
Let  us  assume,  as  we  did  above,  that  each  item  is  indexed  by 
four  terms o 


Searching 

1 AHNO 

2 B C D T 

3 A B M R 

l*  L N 0 F 

5 C G H K 

6 POMP 

7 LPRT 

6 H K L S 

9 B C R S 

etc. 


A 1 3 

B 2 3 9 

C 2 5 

D 2 

P 6 

0 5 6 8 

H 5 

K 5 8 

L i*  ?8 
Mil  6 

HI  u 

Oil* 

P 1*  6 7 


R 3 7 9 

S 8? 

T 2 7 

etc,. 

In  tho  array  labelled  Searchi  ng.  th«  nine  items  sr/i  thirty 


six  indexing  terms  con  be  recorded  on  rine  cords,  A search  for 
any  item  indexed  under  tho  term  "G1'  , or  any  combination  of 
terms  "LP"  , necessitates  a search  of  the  total  file,  in  this 
ease  nine  cards.  But,  if  tho  array  wore  continued  on  through 
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& million  items  the  search  for  "0"  or  WLPM  would  still  involve 
the  examination  of  the  total  file  of  1,000,000,  If  we  turn  to 
the  array  labelled  Collation,  we  note  first  that  each  of  our 
nine  numbers  is  repeated  four  times,  a total  of  36*  This  means 
that  36  punched  cards  (the  number  of  item?  x n)  rather  than 
nine  are  needed.  Now  if  we  desire  ail  items  indexed  with  the 


tarn  "n",  no  searching  is  required;  the  array  shows  us  that 
item  5 and  6 are  under  "Qn,  if  we  wish  all  item*  indexed 


under  Lr,  we  are  not  required  to  search  the  whole  file,  but 
only  to  compare  (collate  the  numbers  recorded)  under  and 
Pn  1 e.g. , L It*?-?  * ,B  i*— 6-7-.  It  is  apparent  that  items  u and 
7 are  indexed  under  " LP". 
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•eesi  to  Indicate  that  there  arc  rwre  terms  than  documents.  For 
any  large  system  of  information  the  reverse  is  true;  there  are 
always  less  terms  then  items  in  any  systsm  of  information  large 
enough  to  raouire  any  organisation  at  all.  Even  in  this  small 
sample , the  number  of  items  under  any  terra  is  less  than  the 
total  number  of  items. 

Ths  number  of  cards  required  for  sotting  up  a collating 
eystem  for  250,000  items  is  again  equal  to  250,000  x n (where 
n is  the  average  number  of  terms  used  to  index  any  item). 

Where  n*U,  wo  navs  a million  cards  just  as  we  did  in  our 
standard  alphabetical  file.  In  collation,  however,  the  cards 


are  not  mr.intniriod  hi  a single  array;  instead  wo  vri  LI  have  a3 
many  arrays  as  there  a no  Jiff*,  runt  bormj  uueii  in  the  system. 


Thus i if  10,000  different  terms  are  used  in  indexing  250,000 


documents  we  will  hnve  10,000  arrays  of  cards.  Each  array 

i 

will  uion  contain  250,000  x h cards  or  100,  The  total  number 
' 1<VX>0 

of  cards  will  be  the  number  of  arrays  times  the  number  of  cards 
in  each  arrayi  10,000  > IOC  * 1,000,000. 

The  process  of  collating  for  any  item  or  i torr,5  i nuC  xed  by 
any  four  terms  i»j  this  system  will  involve  t-ho  collation  of 
UOO  cards  (four  arrays  of  100  each).  Sirce,  in  a file  organized 
for  selection  by  searching,  Ke  will  have  to  search  250,000  cards, 
ws  can  conclude  that  whereas  we  multiplied  cur  file  b/  h,  in 


order  to  shift  from  searching  to  collating,  we  cut  machine  time 

by  a factor  cf  250,0-00  or  625.  On  the  other  hand,  a punched- 
“),00~ 

card  file  for  collating  must  be  maintained  in  a fixed  order. 

It  is  not  possible  to  collate  two  random  filen  by  standard 


machine  methods. 


The  superiority  cx  collation  to  searching  ae  a machino 
method  for  making  selections  from  large  systems  does  not 
materially  advance  us.  A fils  used  for  collation  equals  in 
size  n standard  index  n n d must  exhibit  the  sainn  type  of  rigorous 
order,  f urthormore , collating  i 3 a relatively  slow  machine 
proceri3,  and  colLa ti up  four  array;;  Involves  three  machine  runs. 
There  is  r.o  evidence  that  such  an  opi-raticn  is  less  time-consuni  n 


and  more  efficient  then  searching  for  the  proper  heading  by 
mind,  sys,  and  hand  in  a regular  index. 

We  are,  of  course)  assuming  that  an  item  indexed  by  four 
items  will  require  and  receive  only  four  entries  in  a standard 
index.  If.  in  the  standard  index,  we  wish  to  provide  for  the 
other  possible  permutations  of  terms  in  an  alphabetical  sequence, 
we  must  increase  the  size  of  the  file.  In  a collation  system, 
we  provide  for  all  possible  permutations  with  * maximum  number 
of  cards  equal  to  the  product  of  the  number  of  items  and  the 
average  number  of  terms  us ad  to  index  eaon  item.  Using  cur 
figures  of  250,000  items  and  h terms,  the  difference  here  can 
be  expressed  by  means  of  the  following  equations i 

Collation:  25‘n;npo  x k ■ All  possible  pei-mui-sticr.?.- 

Standard  Alphabetical  Systems!  250,000  x 2Jj  ■ All 
possible  permutations. 

This  capacity  to  provide  for  all  permutations  without  in- 
creasing the  size  of  the  file  constitutes  a definite  advantage 
of  machine  systems  over  standard  alphabetical  indexes.  But 
this  advantage  is  only  aigrdf leant  when  the  number  of  desired 
permutations  is  large  and  the  number  of  units  indexed  is  small. 

From  these  considerations,  the  conclusion  has  been  generally 
drawn  that  the  linear  machine  "C  tLl  G f ang  of  an  alphabetical  in- 
dex is  not  a practical  alternative  to  established  manual  methods. 
It  is  to  escape  thi-.  conclusion  that  once  again  attempts  are 
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being  made  to  develop  classification  systems.  A classification 
system  which  provide?  a hierarchical  sot-  of  classes  and  sub- 
classes presumably  makes  it  possible-  to  search  for  any  item 
in  a class  or  subclass,  rather  than  in  a total  system.  Con- 

34  rJ«»n  nv  nwrvl  a i P r 

amv*  I i.  v»  w Aumoxu  | tuu  t c 

which  contains  upwards  of  ? ,500.000  patents  in  over  300  main 
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cxt&cssvn  Hii.a  mm f % "-a.*  ^uucia33^fl*  suppose  mdL  RHcn  j;awru«  couia 

be  uniquely  classified  and  that  a search  for  a group  of  analogous 
patents  could  be  restricted  to  a search  of  a single  class.  We 
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«ach  of  these  .files  malts 
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cf  lfi  minutes,  whereas  it  would  take  over  SO  hours  to  searc. 
all  tern  rnH"..  Tf  r-rjrvla  were  nrpfiied  in  !,-0..(?o0  ihrin^neA. 
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it  would  take  less  than  a minute  to  find  any  3inglr  patent  or 
group  of  patents  in  the  same  subclass. 

Unfortunately,  an  is  generally  recognized,  the  Patent. 
Office  Classification  does  not  accomplish  the  unique  classi- 
fication cf  analogous  patents.  Recently  we  made  a test  search 
for  patents  on  "aircraft  de-icing  equipment."  Kven  though  we 
found  k subclass  "airciaft-jce  removing  equipment"  under  the 
main  class  "aircraft,"  much  of  the  analogous  art  was  found 
through  crocs  references  to  be  it.  the  other  classes,  namely! 
'"riu.’it  engine:;",  "pumps",  "vibrators",  etc. 


The  Tact  that  any  search  in  the  Patent  Office  Classi- 
fication may  involve  a dcaen  or  more  classs  s and  subclasses, 
indicates  that  this  classification  will  not  provide  the 
unique  location  and  mutually  exclusive  classes  required  for 
cutting  down  the  time  of  machine  searching.  This  conclusion 
is  to  be  expected  since  the  Patent  Office  Classification  was 
not  developed  for  use  with  machines. 
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devise  classification  system?  designed  especially  for  machine 
searching.  In  this  connection  we  will  consider  briefly  Dr. 
Story’s  ^Proposed  Classification  List  for  tne  Army  Technical 
Reference  Service" ~ and  wr.  James  Perry's  work  with  "ab- 
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cwavuxvm  iauuoio  auu  ocmanuxC  i«Cvui  j.ng  » 

Dr«  Story  attempted  to  construct  a classification  which 
would  permit  the  assignment  of  one  class  number  to  a report, 
but  in  his  own  test  he  assigned  an  average  of  l.?3i>  class 
numbers  per  report thus  admitting  his  attempt  at  creating 
mutually  exclusive  class  numbers  was  unsuccessful^  Further- 


13 


12 


mstoTi&i  which  follows  is  based  oo  an  adminis tritive  re- 


13 


port  by  t.  u.  Qull  to  the  Armed  Services 
Agency. 


rachnical  Inform*,  iior. 


His  rule  for  classifying  is  quote  on  p.  ?U~2$  of  4 report, 
"Analysis  of  tho  Proposed  Classification  uiet  for  ih?  Army 
Technical  Reference  Service,  August,  lpl*9"5  prepared  by 
Documentation  Incorporated  in  December  1952,  for  ths  Armed 
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woro,  he  recognised  that  some  subjects  a.ro  common  to  more  than 
cr.o  main  class,  and  provided  three  tables  of  numbers  for  those 
subjects ,x^  many  of  which  are  not  duplicated  in  the  main  desses. 
All  of  the  numbers  were  recorded  on  IBM  punched  cards,  and  the 
cards  arranged  by  main  classes,  which  threw  the  numbers  from 
the  tables  out  of  order.  As  a result,  a search  for  any  number 
from  the  three  tables  required  a search  of  ull  the  punched  cards, 
and  thus  the  attempt  at  reducing  the  number  cf  cards  to  be  searched 
by  using  u classification  was  vitiated  for  the  tables  as  well  as 


Published  reports  are  not  available  from  which  we  can 
evaluate  fully  the  work  of  Mr,  i erry  and  his  colleagues.  Enough 
has  been  said  publicly  to  permit  us  to  offer  & preliminary  esti- 
mate of  the  contribution  of  the  concepts  of  "abstraction  ladders" 
and  "semantic  factoring",  to  & solution  to  this  problem. 

*.n  "abstraction  ladder1  as  originally  proposed  by  hr. 

Perry  was  a highly  doacriptivo  name  for  the  ^oxonomic  relation- 
ship of  clnsposi 


3 

Phylum 

Chordata 

32 

Class 

Mammalia 

327 

Order 

Carnivora 

3271 

Fa ml ly 

Cnnidao 

327 iiB 

Genus 

!)c  R 

3?7h?3 

Specie a 

Lnbrndov  1 

''o 

nr  ret c-: 

A I runic 

('  * '• 


Th*  ny»be?«  at  the  lift  of  th*  tablt  can  fc*  ©enaidarsd  oea?; 

designations  for  the  various  levels  of  abstraction,  Thun,  if 
w wish  to  find  everything  on  Mammalia,  vs  sort*  or  search,  for 
everything  coded  "32"-  Such  a search  would  at  the  same  time 
deliver  ail  information  on  Carnivora  (327),  Canidae  (327li)» 

Doga  (327LS},  etc-  We  can  also  start  at  th-a  ether  end  of  the 
ladder  and  search  for  Labrador  Retrievers  by  searching  for 
everything  ceded  327L56,  Such  a search  would  not  give  us  any 
information  pertaining  to  the  higher  steps  of  the  ladder.  In 
short,  an  abstraction  ladder  makes  possible  e saner ic  Search, 
i.e,,  a search  for  classes  within  classes  without  requiring 
us  to  specify  in  advance  all  the  clusees  and  subclasses  con* 
tained  in  the  class  which  ie  the  object  of  the  search.  Further- 
more, if  we  prefile  the  materiel  by  class,  i.e.,  all  "3' a*  to- 
gether or  all  "32's"  together,  presumably  Jt-  is  only  necessary 
y~j  search  ouoh  prefiiad  groups  for  any  information  or  spy  item 
belonging  to  the  ssjtsa  abstraction  ladder.  To  be  sure,  thio 
possibility  follows  from  c predetermination  of  abstraction 
ladders  and  a coding  sy3tem  based  upon  such  prade termination. 

Suppose  that  v?  have  another  section  of  this  file  on  the 

it  « n ii 

rdbjectjhannsrn  and  Customs  and  that  under  this  class  we  have 
the  following  ebstrnr.tSon  ladder i 
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9 

Manners  and  Customs 

q; 

Rural  Sports 

926 

Hunting 

9286 

Hunting  Dors 

9206u 

Labrador  Retrievers 

It  is  immediately  obvious  that  all  material  on  Dors  will 
not  bo  found  by  searching  for  327o8,  nor  will  all  material  on 
Labrador  Retrievers  be  found  in  one  abstraction  laddor  or  pre- 
filed group  under  Chordata,  Mammalia,  or  Dogs. 

It  will  be  said  at  this  point  that  the  taxonomic  re- 
lations, the  relatione  of  class  inclusion  and  subordination, 
illustrated  by  the  abstraction  ladder  from  Chordata  to  Labrador 
Retrievers  are  more  "true"  or  more  "objective"  than  vhu  rela- 
tione exhibited  by  the  ladder  Mannors  end  Customs  to  labrador 
Retrievers.  In  a certain  snnsp.  we  admit  this  is  the  cnee - 
The  apparent  objectivity  of  abstraction  ladders  in  the  fields 
of  zoology  and  botany  is  a reflection  of  the  fact  that  in  these 
fields  there  a re  established  taxonomies  which  have  achieved 
general  acceptance.  We  must  still  Question  whether  or  not 
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one  ebs traction  ladder  as  more  ’'objective1'  o:  ’-true"  than 


. For  example  tnc  abstraction  ladder i 


another 


Manners  and  Customs 
Rural  Sports 


21; 


Hunting 

Hunt-in?,  Doga 

Labrador  Retrievers 

is  no  more  objective  or  true  than  the  abstraction  ladder i 

Canada 

Newfoundland 

Fauna 

Labrador  Retrievers 

It  will  be  said  here  that  we  have  used  different  princi- 
ples of  subordination j that  the  sense  in  which  Newfoundland 
is  subordinate  to  Canada  is  not  the  sense  in  which  Fauna  is 
subordinated  to  Labrador j or  the  sense  in  which  Rural  3porta 
is  subordinated  to  Manners  and  Customs.  Again,  vc  recognize 
that  this  vague  sense  of  different  kinds  of  subordination  has 
a basis  in  fact.  but  what  this  bssis  is  has  not  hitherto  been 
specified  in  the  literature  of  classification  and  information 
theory, 

Subordination  in  classification  systems  is  always  a re- 
lation of  class  inclusion  as  contrasted  with  the  relation  of 
logins!  conjunction  or  ccordins.ti.on  which  is  ih®  chsrsc t*ic 
relationship  of  elements  in  a coordinate  index.  An  abstraction 
ladder  from  Chordata,  to  Labrador  Retrievers  can  be  illustrated 
by  a set  of  circles  in  which  the  subordinate  circle  ia  in- 
cluded in  the  higher  or  more  generic  circle. 
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Newfoundland  can  be  pictured  as 


iC^iCil  CCnjunCtiOn  of  Fauna  and 
two  overlapping  circleot 


In  the  ladder  Chordata  fco  Labrador  Retriever,  the  class 
Mammals  is  included  in  the  class  cnorbaha.  e'inc?  there  ere 
no  Mammals  which  are  not  Chordata j but  there  are  carnivorous 
animals  which  are  not  Mammals.  For  example,  birds,  fish, 
and  soma  reptiles.  The  Carnivora,  as  an  order  of  the  erase 
Mammal a , is  a class  formed  by  the  logical  conjunction  of  the 

rl  flfi*  VfiVA1»a  r ^ 1 i. «.  *. 
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If  two  ideas  or  clauses  are  related  by  the  relation  of 
logical  conjunction  we  can  aet  up  !>■«  order  of  subordination 
in  either  direction  as  suits  our  purpose,  We  can  for  example, 
set  up  a ladder  in  which  the  largest  class  is  carnivorous 
animals  and  wa  can  subordinate  Mammals  to  carnivorous  animals, 
Just  ae  we  can  subordinate  Newfoundland  to  Faunaj  but  Carnivora, 
dsf  i nsd  £5  special  class  of  Mammals  having  certain  special 
characteristics  is  subordinate  to  Mammals  by  definition.  There 
is  no  question  of  objectivity  or  truth  here  but  only  a matter 
of  definition  or  the  meaning*  of  words,  Being  a mammal  Is  part 
of  the  definition  of  being  a Carnivora,  This  is  more  obvious, 
perhaps,  in  the  case  of  Newfoundland  FaunA,  which  by  the  very 
meaning  of  th6  words  is  recognized  fs  a subclass  of  the  class 
Faunae  The  certainty  and  assurance  vi  t-ft  which  we  subordinate 
Carnivora  to  Mammals  or  Newfoundland  Fauna  to  Fauna  derives 
from  our  acceptance  of  formal  definitions  and  not  from  any 
recognition  or  discovery  of  truth  or  true  relationships  in 
nature , 

A taxonomy  is  a system  of  definitions  which  fixes  the 
relationship  between  a set  of  terms  and,  hence,  of  a sot  of 
classes  denoted  by  the  terms.  Systematic  zoology  and  botany 
and  parts  of  chemistry  tire  taxonomic  sciences  because  their 
vocabularies  arc  fixed  by  such  definitions.  Now  the  extension 
of  the  taxonomic  method  to  science  in  general,  cr  to  the  field 


- tl 


of  in  telii  genes*  aastsne"  tr-a L.  tha  Vw0«i>ui»rio5  or  science 
or  Intelligence  constitute  a taxonomy  or  a system  of  abstraction 
laddsre.  This  assumption  we  can  state  without  qualification  is 

false. 
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PART  III 

In  developing  the  theoretical  discussion  presented  above, 
vre  examined  severe!  ;r.ajcr  classification  systems  to  determine 
how  the  subordination  of  classes  in  achieved  in  them.  The 
National  Advisory  Committee  fcr  Aeronautics  classification 
system  which  wag  studied  earlier  in  connection  with  the  prepara- 
tion of  one  of  our  reports  to  ASTI A,  is  net  a suitable  system 
for  our  present  purposes  since  it  is  admittedly  based  on  local 
and  arbitrary  ground  rules.  We  have,  therefore,  taken  our 
examples  from  systems  which  claim  to  be  based  or.  rational  arid 
objective  considerations,  namely  the  tis  S.  Patent  Office 

16 

Classification  and  Riles' s "Bibliographic  Classification”."'' 

We  might  have  chosen  examples  from  the  library  of  Congress 
system  or  the  Dewey  system  both  of  which  we  hare  subjected  to 
thorough,  examination;  but  the  Patent  Office  Classification 
seemed  particularly  appropriate  to  our  purposes  sinca  those  who 
ire  responsible  for  creating  and  using  it  make  a great  point 
of  the  inability  91  indexes  to  display  the  generic  relation- 
ships and  associations  required  in  patent  scart.hing.  Bliss's 
'^iblio«r-.'iphic  al  Classification"  has  been  chosen  because  it  Is 
the  latest  and,  presumably,  the  roost  '’scientific"  of  all 
library  classification  systems. 

In  both  systems  we  discovered  and  distinguished  three 
mathode  of  achieving  subordination  of  one  idea  to  another: 

^°A  Bibliographic  Classification,  by  Henry  Evelyn  Silas, 
n.  w * Wilson  Co.  a TTmEFTo r5r”TT? 5 2 ) * 


ta 

1 


the  semantic-  tho  topical,  and  the  taxonomic*  We  alee  cip= 

4 ^ - J , —..V. SJ U i. _ ^ a:  > > \ 
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can  only  be  found  In  tha  taxonomic  acianc*?  included  t s 
sections  of  over-all  classification  systems j and  that  the 
balance  of  a general  classification  system  like  the  Patent 
Office  system,,  or  the  Bliss  system  exhibited  only  semantic  and 
logical  subordination. 

Semantic  Subordinatiom 

As  the  name  Indicates,  semantic  subordination  la  ; .rely 
verbal  in  character  end  differs  from  alphabetical  indexing 
only  in  being  arranged  differently  on  e page.  Consider  for 
example,  the  following  eels  of  ter».*«  and  phrases  which  might 
be  found  in  any  alphabetical  index » 

functions,  Addition?,  of  aggregates 

t •*#»*•  < AM*.  4 

f U4iu^Av;:C|  vvn  waciuwmu 

Functions.  Differentiable 
Functions,  Discontinuous 
Functions,  Integrable 
Functions,  Symmetric 
Functions,  Types  of 
or 

Science 

Science , H La  Lory  of 


Science,  Philooophy  of 
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Science,  Principle®  and  Methods  of 
or 


V fives 


V*1?cb,  Cheek 

Valves,  Oate 


Valves,  Reducing 
Valves,  Seated 


If  «*e  m r thsSa  sets  of  wSi'iuo  w iwi c nits  parM  ox 


a el.eaaifi  ration  ayatam  hy  indentation  OH  a p*g* 

se  Mr.  Bliss  has  dont-,  we  get  the  foliowingt 
Types  or  functions: 


L ^ tAfl  rtf  • 4 i "tr<« 

•'tio*  - ©—  ’~-r  C-- -•  . • *•  — »*  « “»  ••■+ 

Continuous  functions 
Diife re n liable  functions 
Discontinuous  functions 
Integrable  functions 
Symmetric  functions 
or 


Science 


History  of  science 

Fhilcocphv  of  science 

Principle  ' and  methods  of  science 


or 


% 


■3 


Valves 


Chock  valves 


Reducing  valvea 
Seated  valves 

The  following  erem.nl*  nf  semantic  subordination  Is  taken 
from  the  Potent  Office  Cl^ssif io« lion ; Class  192 > "dutches 
and  ri>w~r~Stop  Control. K Under  this  classification,  there 
is  listed! 


u'/uwcnes 


30.5  Impact  delivery  type 

31  Automatic 

32  Manual  control 

lil  One  vxy  engaging 

lib  Multiple 

These  headings  can  be  rearranged  for  or  slphofcetical 
index  os  follows* 

Clutches 

Clutches  - Impact  delivery  type 
Clutches,  Automatic 
Clutches,  Automatic  - Manual  control 
Clutches,  Automatic  - One  way  engaging 
Clutches,  Multiple 

Since  thrt  Log’ lining  of  modern  It  brarianship,  exponents 
cf  classification  have  teen  able  to  convince  a great  many 


■*m*w  ■■'f'fl 
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people  that  the  indented  arrange r.iont  is  more  logical  than 
the  inverted;  whereas  examination  discloses  only  a difference 
in  en'  -.tie  or  physical  arrangement.  Mr.  Bliss  and  these  re- 
sponsible for  the  Patent  Office  Classification  share  a failure 
tc  recognise  that  classification;  to  the  extent  that  it  achieves 
subordination  by  semantic  means  (e.g.,  subordinates  "check 
valves"  to  "valves",  "discontinuous  functions"  to  "functions" 
or  automatic  clutches  to  clutches,  depends  upon  words  ana  not 
upon  &ny  logic  of  ideas  which  underlies  the  words.  ?h«t  5 ■ 
to  say,  the  boast  which  classifiers  make  of  having  achieved 
logical  order  as  opposed  to  verbal  or  alphabetical,  order  is 
empty  and  meaningless,  to  the  extant  that  they  use  semantic 
subordination. 

Topical,  Subordination  2 

The  second  way  classifier*  achieve  subordination  is  through 
topical  subdivision.  This  method  is  called  Across  classifica- 
tion" by  Mr.  Bliss  in  his  introduction  and  he  illustrates  it 
by  means  of  the  following  tables « 


Plants 

Insect® 

Birds 

Mammals 

Aquatic 

! 

L 

1 

i 

Terrestrial 

1 

l 

1 

! 

| 

Amphibious 

Xeric 

i 

_ . i 

*s 


1 

j 

1 
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Aquatic 

Land 

Amphibious 

Xeric 

- I — i 

Insects  j 

f 

1 

Birds 

i 

! 

i 

Plnnt.g 

1 j 

L ___  , 

»S~ 1 _ 

tuMiituriAa  j 

i | 

1 J 

1 

1 

L_ 

It  should  be  apparent  that  there  18  no  real  difference  between 
these  two  tables  and  that  it  is  no  more  logical  or  scientific 
to  subdivide  forms  of  life  by  habitat  than  to  subdivide  habitat 
by  forms  of  life.  Kr.  Blis3  realizes  thisj  hence,  his  use  of 
the  term  "cross  classification"  arid  hie  statement  that?  "Clasroo, 
or  eub-clanses.  of  the  same  grade,  or  order,  of  division  are 
termed  coordinate,  and  tire  principle  of  placing  there  ir  such 
order  is  coordination.  Subordination  and  coordination  ere  thus 

relative  to  division  and  gradation.  The  coordinate  aub-classea 
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cf  several  coordinate*  c.adocn  iha,  t>tj  coorQin&Led’’ . However, 
he  does  not  take  the  final  and  necessary  step  which  is  tha 
recognition  that  the  subordination  of  one  topic  to  another  is 
arbitrary  and  parochial  and  has  no  claim  to  logical  or  uni- 
versal significance. 

The  following  example  of  topical  subordination  is  taken 
Irom  the  Patent  Office  Classification  Cxnes  75,  "Metallurgy." 

1? 

Bliss,  ibid,,  p,  6. 


£ 


122 

- 34  - 
Alloy* 

136 

Aluminum 

l 

Copper 

liiQ 

Tin 

1U1 

Zinc 

lk 2 

Magnesium 

Ikl 

Silicon 

llifi 

NlCiCQA 

lh$ 

M \ / 

JLUO 

Silver 

Zinc 

147 

Magnesium 

148 

Silicon 

153 

Copper 

154 

Tin 

156 

Lead 

156.5 

Zinc 

1 C7 
• 

Zinc 

157.5 

Zinc 

Here  again,  it  ia  clear  that,  topical  subordination  is  really 
coordination.  There  is  no  sense  in  which  aluminum  i«  more 
generic  than  copper  or  copper  mom  generic  than  tin  or  sine. 

We  can  put  thia  same  observation  in.  stronger  language  by  noting 
that  it  is  nonsense  to  suppose  an  arrangement  on  a page  can 
make  copper  generic  to  tir.  or  tin  subordinate  to,  or  a sub- 
division of  copper  in  the  sense  that  carnivora  are  subordinate 
to  or  a subdivision  of  ma  mals  or  iodine  is  subordinate  to  or 
a subdivision  of  halides. 

Those  two  forma  of  relationship;  the  topical  and  semantic, 
constitute  overwhelming  proportion  of  most  classification  systems. 
Once  this  pr«nis9  is  established  the  conclusion  follows  that 
univei8Bi  classification  is  no  more  significant  than  a pattern 
of  printing  on  a pegs,,  and  has  no  logic  other  than  the  logic 


of  general  discourse 


Taxonomic  S u bo rdJ nation) 


All  general  classification  systems  which  include  sections 
on  botany,  zoology  and  chemistry  exhibit,  as  we  noted  in  Fart 
IX,  genuine  taxonomic  relations  of  one-way  subordination  and 
inclusion.  In  the  latent  Uffice  Classification  we  find  in- 
stances in  Clara  260,  “Chemistry,  Carbon  Compounds", 


2U1 

250 

2y2 


Azine 

Diazine 

Pyrimidine 


In  this  cane  as  in  our  previous  examples  drewn  from  the  field 
cf  zoology,  the  very  meaning  nf  the  vor^s,  determines  that 
Azins  includes  Diazines  and  include  ryrimiames. 

since  Bliss' a Bibliography o Classification  utilizes 
biological  laXutiOmj.es  for  his  class  "f”  Botany,  and  hi  a class 
nG"  Zoology,  there  is  no  need  to  labor  this  point  any  further. 


Although  fully  developed  abstraction  ladder®  dc  net 
exist  outside  of  the  sp-eial  tarcncmic  sciences,  in  all  field* 
of  science  w*  do  use  words  which  are  defined  in  terms  of  their 
relation  to  more  generic  words  cr  idmmu.  f-ct  that  in 
recant  months  we  nave  heard  little  about  sb* traction  ladders 
from  **r.  Perry  and  more  about  an  operation  known  *■  "semantic 
factoring"  is  no  doubt  attributable  to  hie  recognition  that 
evfcn  though  wa  cannot  create  truly  significant  abstraction 
ladders,  we  can  usually  consider  any  class  in  relation  to  a 
higher  class.  For  example,  a bomber  may  be  defined  as  a typ* 
of  airplane,  and  in  aome  informs tion  system*  it  might.  oe  worth 
while  to  index  any  material  cr,  bombers  unisr  both  headings, 
^bombers"  and  r;  airplanes".  Such  two-level  relationships  are 
rj'.'t  ladders  except  in  a sense  which  is  so  as  to  be 

trivial.  We  have  noted  in  our  empirical  worK  that  indexing 
on  two  levels  usually  makes  a good  deal  of  sense,  whereas  the 
attempt  to  go  beyond  two  levels  makes  very  74ttle  sense.  In 
one  instance,  for  example,  ws  dete.^dned  to  index  material  on 
pentodes  under  the  heading  "tubes",  out  it  would  have  seemed 
silly  to  use  the  next  higher  level  "electronic  devices".  We 
might  index  material  on  iodine  under  "halogsns"  but  not  under 
"chemicals",,  In  a lectuu  giver  at  the  School  of  library 


Uw  U i i 


Service 

sample 


. Culvjnhia  ii:ii  vei'o » ijy  in  lyo,  Mr,  Furry 
ot  summit ic  i'nc  tori  ng  4 i,hn  rylulluaship 


ur*-d 

Iwilwfcf  r> 
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terms  "weapon"  and  "mine" . He  could  not  oii'er  any  term  for 
the  next  hippier  or  lower  level  and  staged  that  semantic  factor- 
Inc  usually  involved  only  two  levels.  One  fact  which  may  ac- 
count for  th«  case  with  which  we  can  index  cn  two  levels,  or  our 
readiness  to  accept  oerumtic  factoring  a3  a two-level  process, 
is  the  standard  dictionary  practice  of  defining  something  by 
describing  it.  .*vs  a special  kind  of  the  next  hiphor  gemi3r 
Webster’s  Collegia  to  Dictionary  define:-,  "pistol"  ar  a certain 
kind  of  firearm:  it  defines  "firearm"  as  a certain  kind  of 
v-iespcr.j  and  it  defines  "weapon"  as  a certain  kind  of  instru- 
ment. 


But  our  problem  ho re  is  not  with  the  possibility  of 
defining  words  on  wo  or  more  levels,  but  with  estimating  the 
contribution  of  such  definitions  t.c  the  solution  of  the  .roblcm 
of  execs rivu  znchi  :«.-ofrf>reh  tir.;c«  if  in  the  vocabulary  of  ar.y  . 
system  of  information  the  number  of  semantic  factors  is  a small 
portion  of  the  tot'll  voc-ihul ary  and  or.«:h  term  in  this  vocabu- 
lary can  only  be  facto  red  on?  way,  i .1:  • , can  be  related  to  orly 
or  - of  the  semantic  factor::v  then  v?  would  nppr-  ci.-bly  reduce 
i;»  archinr  tine  by  restricting  ■ p.*rrc!iitig  for*  any  Hen  to  nn 
army  cf  t.-*j*. .*•  urvv-r  or.c  feei-nr.  but  r-  have  j:o  yea-on  to 
£;jv,j-r  til.*- 1.  1 1 1 i uitu  ,11  '-n  li'.lds  im  rvujVl.be  wide  ti>  held  - 


for  any  actual  inforait»ion  system.  V*  rtoofniM  that  pistols 
era  firearms  and  firearm*  are  weapons#  But  pistols  may,  with 
«b  much  logic , be  group*!  with  S&m  Brown  belts  ana  clothing 
under  the  general  class  "officers*  squipmeni"  as  with  minee, 
eicsdc  bc.v.bs  end  guided  missiles  under  the  general  class 
“weapons"* 

In  short , we  ucy  take  advantage  of  what  the  dictionary 
tails  us  about  the  meaning  of  words  to  make  our  indexing  sor® 
useful,  for  searches  at  various  levs  is  of  r,sr«r*.V  ty,  but  we 
cannot  utilise  such  definitions  to  divide  = system  of  in* 
formatter-  In  order  to  reduce  the  time  of  search*  Our  con- 
clusion hens  5S  with  all  of  our  conclusions  in  this  paper, 
has  an  empirical  as  well  as  a theoretical  basis* 

In  ths  paps-  vs  quoted  at  the  beginning  of  this  dis- 
cussion, end  in  another  paper  in  the  ease  volume , we  attempted 
to  find  in  the  idea  of  ’bategoricatior."  a kind  of  special 
grouping  of  ideas  or  semantic  factors  without  syatnnatio 
subdivisions  - a middle  ground  as  it  w .-v  between  classifica- 
tion syeioit®  and  i;w  straight  alphabe ideation  of  v-  index* 

Ws  are  now  certain  on  the  basis  of  effort-  extending  over  a 
year  anu  s>  half,  ana  supporting  evidence  fro?  one  of  our 
clients  vrao  #.ioo  attempted  to  categoric  ar.  extensive  vseabu* 
lary,  th/.t  categorization  is  not  practical  for  large  systems* 
Fcr  particular  and  loci?  purposes  involving  highly  specialised 


collections  of  information,  it  is  possible  to  establish  rsla*> 
tivaly  adequate  categories,  or  semantic  factor*,  *v“n  “h- 
atr&cticn  lgdaerr,  Bui  the  more  general  the  eastern  and  the 
more  general  its  use,  the  mure  difficult  it  becomes  to  set 
up  adequately  defined  and  ™*>r,uaily  exclusive  c&tagories. 

If  vq  were  concerned  with  indexing  all  materials  or.  a 
particular  disease  we  might  set  up  the  following  list  of  catfw 
gori.es i symptoms,  etiology,  treatment,  geographical  distribu- 
tion, prognosis,  economic  and  social  factors,  complications, 
age  groups,  or  any  others  which  suggest  the-  selves  j»s  generic 
interests  or  factors  under  which  wu  could  croup  our  vocabulary 
of  Indexing  terms.  But  v*>  doubt  the  possibility  of  devising 
a te  of  categories  or  semantic  factors  for  one  terms  used  in 
Indexing  the  subject  of  medicine,  the  field  of  science,  the 
interests  of  the  Department  of  Defer.se,  or  the  claims  of 
patents.  This  is  not  a counsel  of  despair,  but  rather  realism 
in  the  face  of  experience- 

In  a certain  sense,  this  paper  may  ne  regarded  is  a 
clearing  of  the  underbrush  before  beginning  construction. 

This  Aclaphor  in  sound  because  it  should  warn  us  that  under- 
brush. is  not  something  we  cor.  eliminate  or.ee  and  for  all-  Vie 
started  in  the  beginning  of  this  paper  with  certain  quotations 
from  Dr.  5hcra'3  work  and  certain  observations  which  seemed 
to  indicate  that  the  choking  underbrush  of  hicrarciilcai  ciassi 
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